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PROTEIN EXPRESSION VIA SEED 

SPECIFIC REGULATORY SEQUENCES 

FIELD OF THE INVENTION 
This Invention relates to seed specific expression casecces.for 

5 use In regulation and expression of proteins In plants. 

BACKGROUND OF THE INVENTION 
In oany plant species, seed storage proteins represent the 
predomlnace source oE amino acids and nutrition for the germinating 
plant. It is well known that genes which code for storage proteins 
10 are strictly regulated. Seed storage proteins are produced in large 
quantities at a determined stage of develcptsent; expression Is both 
tissue specific and developmentally controlled. These proteins are 
generally localized in specialized organelles called protein bodies 
or protein storage vacuoles and are essentially protease free . 

IS Because the seeds of many plants can readily synthesize large anounts 
of these storage proteins It Is believed that the biological machin- 
ery within seed cells can be put to use synthesizing proteins of 
importance to pharmacological and industrial needs. Thus, charac- 
terization of the storage proteins as well as the ability to direct 

20 protein synthesis toward the protein bodies is of continuing ex- 
perimental, commercial, and nutritional interest. 

The storage proteins of the major grain crops have been par- 
ticularly veil studied. In the legumes the major seed storage 
proteins are the globulins, represented by two types: the nonglyco- 

25 sylatcd 11 S proteins and the glycosylated 7 S fraction. In the 
common bean (Phaseolus vulgaris) about 50% of the total seed protein 
is represented by a group of three polypeptides (the o, ^ , and 7 
subunlts) collectively known as phaseolin. In soybean (Glycine max) 
the major storage protein in known as j5-conglycinin and its subunlts 

30 (o, q', and fi) represent up to 80% of total seed protein in that 
genus . 

In com (Zea mays), a representative monocot. the protein zeln 
constitutes about 80% of com seed protein. While there are many 
zeln-related genes responsible for zeln production, one type, the ^- 
35 zelns. are represented in very low copy number and yet account for 
10% f total seed protein. 

However, the se ds of many dicot and mon cot plants are defl- 
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clent in certain essential amino acids which require the addition o£ 
these amino acids when these seeds ar used as the predominate s urce 
of prot In in the human or aniaal diet. For a number o£ years 
investigators have attempted to couple the high level expression o£ 
5 seed storage proteins vith recombinant engineering techniques to 
produce plants with enhanced nutritional value. For instance, while 
legume seed contain high levels of storage protein, the protein is 
deficient from a nutritional standpoint in that the globulins are 
relatively low in sulfur- rich amino aclda , making legume seed a 
10 nutr itional ly Incomple te foods tuff . Recombinant techniques could be 
used to increase the amount of sulfur in the seed storage proteins or 
direct synthesis of heterologous sulfur-rich proteins. The develop* 
ment of seed specific promoter cassettes for monocotyledons plant 
species is also important since the seeds from these plants are 
15 widely used as a food source for animals and humans. Thus, in a 
similar manner, zeln, which Is lacking in the essential amino acid 
lysine, can be made nutritionally complete. 

On an industrial scale, such techniques can be used to direct 
heterologous proteins or enzymes into the storage vacuoles. As. 
20 protease- free zones within the seed, such expression would serve as 
an abundant and easily purified source of protein. Thus, plant seed 
could provide an alternative means for producing hormones and enzymes 
needed for commercial and Industrial use. 

By using genetic engineering techniques it is already possible 
25 to transfer genes Into many plant species; direct expression of 
homologous (native to the host) and heterologous (foreign) plant 
genes is well established. Numerous reports indicate that seed 
storage protein genes have been transferred to other plants with 
expression in both a tissue specific and developmen tally regulated 
30 manner, Beachy, et al., EMBO 4:3047-3050. (1985), as well as Into 
protein bodies. Greenwood and Chrlspeels, Plant Physiol. 79:65-71 
(1985). 

The expression of non-plant genes in plants has also met with 
success, but on a more limited level. It has been suggested that the 
35 difficulty In attaining expression of non-plant proteins is due to a 
failure of the plant splicing machinery to recognize introns (also 
called Intervening sequences) present in the non-native transcribed 
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pre-nRHA, K. Hartmuth and A. Barta. Nucleic Acids Res. 14: 7513-28 
( 1986 ) , s Ince removal of incrons Is nec es sary in order for the mRNA 
to be functional. See, H. Green, Annu. Rev. Genet. 20: 671-78 (1986). 
For instance, Barta, et al. Plant Molecular Biol. 6:347-357 (1986), 
5 report construction of a chimeric gene in vhich the complete human 
growth hormone (hCH) gene is flanked by DNA fragments coding for the 
promoter and polyadenylation sites of the nopaline synthase gene. 
When the chimeric gene is introduced into sunflower and tobacco 
callus tissue the gene Is transcribed to the pre-mRNA stage. However, 

10 the transcript is non- functional in the plant tissue as the pre-mRNA 
Is not further processed by the plant to mRNA. Thus the foreign 
protein, hGH, is not synthesized. 

An alternative to direct expression of foreign proteins is to 
make a gene construct wherein the gene encoding for the desired pro- 

15 tein is inserted within a nonfunctional region of a plant protein. 
European Patent Application 319 353 discloses a method for expressing 
a desired protein by modification of the hypervariable region of a 
determined storage protein by an Insert encoding the desired peptide. 
In this way a modified storage protein Is expressed which has 

20 contained within it amino acids which code for the foreign protein. 
The foreign protein Is then cleaved by enzyme digestion, isolated and 
purified. However, the insert must be in reading phase with the non- 
modlfied part of the storage protein on either side of the insert, 
thus effectively limiting the size of the foreign protein which may 

25 be expressed to the length of the hypervariable region of the storage 
protein. 

The present invention overcomes the problems outlined above by 
providing for seed specific expression of a desired protein compris- 
ing fusion of a gene with a seed specific gene regulatory sequence. 
30 INFORMATION DISCLOSURE STATEMENT 

The present Invention uses a gene amplification technique which 
Is a modification of the polymerase chain reaction (PGR) method 
described in U.S. Pat. 4.683,195. 

European Patent Application 255 378 provides nucleotide se- 
35 quences for seed specific transcriptional regulation. However , the 
reference does not teach the translation cassettes of the present 
invention. 
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European Pat nt Application 295 959 refers to DNA expression 
cassettes for transcription of Che brazil nut sulfur-rich 2S seed 
storage protein. The reference neither teaches nor discloses the 
translational expression cassettes of the present invention. 
5 European Patent Application 319 353 refers to a process for the 

production of useful biologically active polypeptides through Che 
modification of plant genes encoding S2 albumins and is discussed 
above . 

An East German Patent Application, DD 263-081-A, reports a 
10 process for the production of useful proteins via a vector containing 
a regulatory sequence isolated from a plant gene. 

Barta. A. , et al. "The Expression of Nopaline Synthase -Human 
Growth Hormone Chlmaeric Gene in Transformed Tobacco and Sunflower 
Callus Tissue", Plant Molecular Biol, 6:347-357 (1986), report 
13 construction of a chimeric gene containing the complete human growth 
hormone (hGH) gene flanked by the promoter and polyadenylation site 
of the nopaline synthase gene and is discussed above. 

Beachy, et al, "Accumulation and Assembly of Soybean ^-Con- 
glycinin in Seeds of Transfonaed Petunia Plants", EHBO. 4:3047-3053 
20 (1985), report the accumulation of the a' -subuntt of the soybean seed 
storage protein ^-conglycinln in the seeds of transformed petunia 
cells. The reference does not teach or disclose the cassettes of the 
present invention. 

P.P. Chee, et al, "Expression of a Bean Storage Protein 
25 'Phaseolin Kinigene' in Foreign Plant Tissues", Gene, 41:47-57 
(1986) , report that stable and functional phaseolin ntRNA can be 
produced in tobacco by the use of an mutant phaseolin gene lacking 
the native Introns . 

The gene coding for the ^-zein 15Kd protein and regulatory 
30 regions are used in this invention. The sequence and isolation of 
this sulfur-rich zein protein is disclosed in Pedersen, eC al, 
"Sequence Analysis and Characterization of a Maize Gene Encoding a 
High-Sulfur Zein Protein of 15,000", J. Biol. Chem. 261:6279-6284 
(1986). 

35 SengupCa-Gopalan, et al, "Developmentally Regulated Expression 

of the Bean ^-Phaseolin Gene in Tobacco Seed", Proc. Natl. Acad. USA, 
82:3320-3324 (1985), report the transformation of tobacco with a 3.8- 
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kB phaseolln DKA fragment containing the coding region and flanking 
DNA necessary for c rr ctly regulated expression of the gene. 
However, the present invention uses specific linkage sequences not 
taught by the reference. 
5 The complete nucleotide sequence of the French bean seed storage 

protein phaseolln is disclosed in Sllghtom, et al, "Complete Nucleo- 
tide Sequence of French Bean Storage Protein Gene: Fhaseolin", Froc. 
Natl. Acad. USA, 80:1897-1901 (1983). Some of the methods reported 
therein are used In the present invention. 
10 SUMMARY OF THE INVENTION 

The present invention provides a seed specific expression 
cassette which has a promoter derived from either phaseolln, a' - 
subunit of ^-conglyclnin, or ^-zeln 15Kd; a translation initiation 
signal from either phaseolln, the a'-siibunlt of ^-conglycinln, ^-zein 
15 15Kd. or an animal gene; a gene derived from brazil nut sulfur-rich 
seed storage protein or an animal gene; a translation termination 
region derived from either the animal gene, phaseolln. or ^-zein 
15Kd; and at least one polyadenylatlon region from phaseolln, the 
animal gene, or >9-zein 15Kd; wherein the regulatory sequences are 
20 operably linked to one another in such a manner that the gene is 
expressed in seed or seed storage protein bodies. The invention 
further provides enzyme restriction sites incorporated Into the 
sequences which link the regulatory elements and genes. The inven- 
tion also provides ^hat cert:aln of the regulatory elements be derived 
25 from the same genetic source. 

More particularly, the present Invention provides an expression 
cassette in which the promoter is from phaseolln or a'-subtmlt of p- 
conglycinin, the animal gene and remaining elements are from bovine 
somato- tropin, and the restriction sites contained within the linkers 
30 are Ncol and Hindlll. 

More particularly, the present invention provides an expression 
cassette In which the promoter is from phaseolin or a' -subunit of fi- 
conglycinin, one poly-A signal is from phaseolin, the remaining 
regions, including a second poly-A site, are from bovine somatotropin 
35 and the restriction sites are Ncol and Hindlll. 

Most particularly, the present Invention provides an expression 
cassette for the expression and regulation of bovine somatotropin in 
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seed storage bodies. 

In another embodiment of the Invention the gene Is from brazil 
nut 2S storage protein or subunits thereof, the promoter and tran- 
slation initiation signal are derived from the same genetic source, 
5 and the translation termination signal and poly-A region are, from 
phaseolin or ^-zeln 15Kd. 

The Invention also provides for cells , plants , and seeds 
containing the expression cassettes disclosed here. 

DETAILED DESCRIPTION OF THE INVENTIOS 
10 The present Invention provides a plant In which a gene coding 

for a specified protein Is Incorporated Into the plant's genome in a 
manner such that regulation and expression of the specified protein 
is under the control of a seed specific regulatory sequence . The 
protein vhlch is of interest and for which expression Is desired may 
15 be a plant protein homologous or heterologous to the host plant, or 
on the other hand, may be normally produced In a virus, micro- 
organism, or other animal tissue. 

The following examples Illustrate specific embodiments of the 
Invention. Ue have used a modified form of the polymerase chain 
20 reaction technique to combine various seed storage protein gene 

regulatory elements (promoters, translation Initiation sites, 
translation termination sites, and transcription polyadenylation 
(poly-A) signals) with protein coding regions to form expression 
cassettes vhlch direct and control the production of various modified 
25 proteins in the seeds of plants. These expression cassettes nay then 
be used to transform plant cells from which whole plants are derived. 
Gene regulatory elements are used which may be Inserted into hosts of 
either mohocot or dlcot plant species. Of particular Interest as 
host plants are those which are cultivated for high levels of seed 
3D production. Such hosts include wheat, com, rice, etc. (monocots) 
and soybean, common bean, pea, etc. (dlcots) . Com and soybean 
represent ideal hosts and seed storage protein regulatory regions 
Isolated from Phaseolus vulgaris (phaseolin gene), Glycine max 
(promoter from the a' -subunit of ^-conglycinin gene) and Zea mays 
35 (zein 15 Kd peptide gene) are preferred as sources of regulatory 
regions. Proteins which would have value when expressed in plant 
seeds includ those designed for nutritional Improvement (e.g., high 
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sulfur soybean and cotomon bean, high lysine com, etc. ) , pharm- 
acological iaporCance (enzymes and hormones) , and industrial value 
(enzymes). Ve have demonstrat d seed specific expression cassettes 
containing genes designed to improve the nutritional value of seed 
5 protein meal by encoding a high-sulfur gene. U have also demonstra- 
ted the production and segregation of bovine somatatropin (Bst) 
protein into protein storage vacuoles. 

Numerous cloning vectors are available which have an appropriate 
restriction site, a replication system, and a marker. As a cloning 

10 vector we have selected pUClB, Norrander et al. Gene (1983)2fi: 101- 
106, containing an amplclllln resistance marker. Other suitable 
vectors are veil knovn and may be easily adapted for use with the 
specific restriction sites of the present expression cassettes. 

For the genetic engineering experiments described in the 

15 examples one can use many different methods of obtaining the desired 
sequence junctions between regulatory regions and coding DNA, such 
as t site specific mutagenesis , linkers , and/or polymerase chain 
reactions (PGR). Ve have selected to use a modification of PGR, 
which we call custom polymerase chain reaction (CPCR) as the technl- 

20 que for the addition of regulatory regions with coding DNA. CPCR 
allows one to incorporate specific enzyme restriction sites directly 
into the linkage oligomers as they are synthesized. The oligomers 
are then used in PGR amplification of a desired gene or regulatory 
element. The advantage to this modification is a more rapid eon- 

25 struction of expression cassettes from the amplified sequences. 

We include in the following examples expression cassettes for 
transcriptional and trans latlonal fusion; two dlcot specific seed 
promoter cassettes; and one monocot specific seed promoter cassette. 
Those skilled in the art will recognize that these cassettes may be 

30 utilized in a variety of hosts. In addition, the incorporation of 
these cassettes into a host plant genome may be accomplished by a 
variety of conventional methods. Such methods Include transformation 
with Ti-DNA (fi^ tumef aciens . rhlzogenes . etc.), protoplast fusion, 
injection, electroporation, and mlcroprojectile bombardment. These 

35 plant seed expression cassettes are xised to express a high sulfur 
seed storage protein gene In the seeds of both dlcot and monocot 
plants and to express a non-plant gene, bovine somatotropin (Bst). in 
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che seeds of dlcot plants. 

Transgenic plants containing Ch heterol gous gen of inter st 
are obtained through the Rl generation. S eds fr a these plants, R2 
seeds, are stained with protein antibody f r TEK visualization f the 
5 heterologous protein localized within the protein storage bodies.. 

The following abbreviations are used throughout this disclosure 
to denote gene regulatory elements: 
P - Promoter 

SP - Translation Initiation Signal or Signal Peptide 
10 TT - Translation Termination Signal 

S - Polyadenylatlon (Poly-A) Signal 
The following abbreviations are used throughout this disclosure 
to indicate source of the gene regulatory elements: 
bv - genus Bovine 
15 bz - Bertholletla excelsia (Brazil nut) 

gm - Glycine max 
pv - Phaseolus vulgaris 
zm - Zea mays 

The following abbreviations are used throughout this disclosure 
20 to indicate proteins: 

"Bst* means bovine somatotropin protein. "BST" means the 
gene encoding Bst. 

"Bz" means a sulfur-rich protein isolated from Brazil nut 
seed. "Bz-1*, hz-2* , and "Bz-S* are subunits of Bz. These proteins 
25 are described in further detail below. "BZ". "BZ-l", •'BZ-2". and "BZ- 
3" are the genes encoding the respective proteins. 

The following terms are used throughout this disclosure and are 
defined as follows: 

"Seed specific expression cassette" means a DNA construct 
30 containing gene regulatory elements and a gene encoding a protein of 
interest, in a suitable vector for incorporation into a plant genome, 
such that the expression of the gene is under the control of the gene 
regulatory regions, the regions being of the specificity, orienta- 
tion, and reading frame such that expression of the gene is in the 
35 plant seed. 

"Expression cassette" means a DNA construct lacking at least a 
gene encoding a protein of interest. 
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"Oligomer primer* means a nucleotide sequence needed for FCR 
amplification of a gene or gene r gulatory element. The primers used 
in this invention have been synthesized to Incorporate specific 
restriction enzyme sites Into the primer. 
5 Charts 1 through 22 Illustrate the constructions of the present 

invention. The following conventions are used to Illustrate plasmlds 
and DNA fragments : 

(1) The single line figures represent both circular and linear 
double-stranded DNA. 
10 (2) Asterisks (*> indicate that the molecule represented Is 

circular. Lack of an asterisk Indicates the molecule Is linear. 

(3) Endonuclease restriction sites are indicated above the line. 

(4) Genes are indicated below the line. 

(5) Distance between genes and restriction sites are not to 
15 scale. The charts show their respective positions only. 

The methods used here employ many standard recombinant DHA 
procedures as well as those for creating transgenic plants and are 
well known by those skilled In the art. Standard methods and laborat- 
ory techniques for recombinant experimentation can be found In T. 
20 Haniatls et al. . 'Molecular Cloning; A Laboratory Manual" , Cold 
Spring Harbor Laboratory (1982) (recombinant DHA experimental protoc- 
ols and methods); Killer. "Experiments In Molecular Genetics", Cold 
Spring Harbor Laboratory (1972) (culture techniques); Southern, J. 
Mol. Biol. 503 (1975) (clone verification): Sanger, et al, Proc. 

25 Natl. Acad. Scl. USA 2^:5463 (1977) (DNA sequencing). S.B. Gelvln and 
R.A. Schllperoot. 'Plant Molecular Biology' (1988), supplies techni- 
ques for use in transforming and cloning plants > 

DESCRIPTION OF THE PREFERKED EMBODIMENTS 
EXAMPLE 1 

30 CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING THE 

PHASEOLIN PROMOTER (pvP) AND POLYADENYIATION SIGNAL (pvS) 

FOR TRANSCRIPTION FUSION TO CODING DNA 
The major seed storage protein gene of Phaseolus vulgaris 
(common bean) Is known as phaseolln. The complete sequence, Includ- 
35 ing 5' and 3' regulatory elements, and cDNA counterpart have been 
published by Slightom et al. (1983). As a source of phaseolln 
regulatory regions we use the clone pPhasmin which has the Intronless 
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cDHA counterpart In place of the g nomic coding and Intxon regions, 
as described by Cramer et al. . Pre. Natl. Acad. USA, 82:334-338 
(1985) . Oligomer primers for amplification of phaseolin promoter 
(pvP) are constructed to contain Hindlll (In the 5' primer) and Kcol 
5 (in the 3' primer) sites. The Kcol site is designed to Include a 
translation initiation site which conforms to the rules described by 
Kozak (1986) by containing the sequence AAXXATGG where X represents 
G, A, T or C. Sequences of the oligomers used for the amplification 
and engineering of the phaseolin promoter are shown in Chart 1(a). 
10 Using CPCR technology Che restriction sites are fused to the 500 b.p. 
phaseolin promoter region resulting In the fragment designated pvP in 
Chart Kb). 

In a similar manner oligomer prlners are synthesized to flank 
the phaseolin polyadenylatlon (poiy-A) signal. In this case, however, 

15 the 5* primer Incorporates a Hcol site and the 3' primer a Hindlll 
slte> The nucleotide sequences of these two primers is shown in Chart 
1(c). Amplification results in a 1100 b.p. fragment designated pvS in 
Chart 1(d). The pvP and pvS fragments were subjected to digestion by 
NcoJ. followed by ligation of the fragments (Chart 1(e)). The ligated 

20 pvP + pvS fragment (1600 b.p.) is subjected to ifindlll digestion, 
purified on polyacryl amide gel, and cloned into fflndlll digested 
pUC18 . The clone containing the correct pvP + pvS sequences (con- 
firmed by restriction enzyme mapping and nucleotide sequencing) is 
referred to as pUClSpvPpvS in Chart 1(f). This expression cassette 

25 is used with coding regions which either contain their own transla- 
tion initiation signal or do not need a signal for proper targeting 
of the protein product. in addition, any coding regions inserted 
into this cassette must include a translation termination signal. 

EXAMPLE 2 

30 CONSTRUCTION OF A PHASEOLIN EXPRESSION CASSETTE CONTAINING 

PHASEOLIN PROMOTOR (pvP) , SIGNAL PEPTIDE (pvSP) , 
TRANSLATION TERMINATOR (pvTT) . AND POLY-A SIGNAL (pvS) 
FOR TRANSIATIONAL FUSIONS 
In this example oligonucleotide primers are designed for the 
35 amplification of promoter and terminator fragments from pFhasmin 
which contain the phaseolin signal peptide (SP) and translational 
termination (TT) signals, respectively. N- terminal analyses of 



BNSDOCID: <mO 91l3993AlJj' 



wo 91/13993 



-11- 



PCr/US91/00887 



phaseolln protein by Parran et al., Phytoch mlstry 26: 335 (1987), 
show that cleavage is ragged and occurs aft r aialno acid residue 23 
and before amino acid 27. Thus, to ensure that the cassette contains 
the anino acids necessary for sigrial peptide cleavag the cassette is 
5 designed to include through r sidue 31. F r CPCR amplification of 
the pvPpvSP fragment oligomers are designed which contain an inframa 
5' /findlll sice and an Inframe 3' Ncol site; these sequences are 
shown in Chart 2Ca). Amplification yields the 580 b.p. fragment 
designated pvPpvSP in Chart 2(b). 
10 To ensure complete reading of the translation termination 

sequence, the 4 amino acids S' of the translation termination codon 
are retained. These phaseolln 3' regulatory regions are obtained 
using the primers shown in Chart 2(c). Again, these primers contain 
an inframe Nco I site in the 5' oligomer and a Hlndlll site in the 3' 
15 oligomer. Amplification yields a 1200 b.p. fragment referred to as 
pvTTpvS in chart 2(d). 

Both fragments, pvPpvSP and pvTTpvS. are subjected to digestion 
by Ncol and the exposed Ncol sites are llgated together to yield the 
fragment referred to as pvPpvSPpvTTpvS in chart 2(e). This fragment 
20 is then digested with Hindlll to expose CPCR engineered 5' and 3' 
sites followed by cloning into ifXndlll digested pUC18. The phaseolln 
expression cassette referred to as pUClSpvPpvSFpvTTpvS In chart 2(f) 
is obtained and checked by restriction enzyme mapping and nucleotide 
sequencing to determine that it is correct. 
25 .This seed specific expression cassette is used with coding 

regions which need a seed specific signal peptide and an inframe 
translation termination region. 

EXAMPLE 3 

CONSTRUCTIOH OF AN EXPRESSION CASSETTE CONTAINING 
30 THE PROMOTER REGION OF THE GLYCINE MAX SEED STORAGE PROTEIN 

q'-SUBUNIT of ^-CONGLYCININ (gmP) AND PHASEOLIN POLY-A SIGNAL (pvS) 
In soybean (Glycine sp.) ^-conglycinin represents the major seed 
storage protein. The level of expression is very high; each of the 
two to four subunits account for between 12 to 20% of the total 
35 protein in the seed. The nucleotide sequence of the promoter region 
of Glycine max a' -subunit of j^-conglycinin has been published by 
Doyle tel., J, Biol. Chem. 261:9228 (1986). 
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The promoter (gmP) Is amplified from 250 ng t Cal Glycine wax 
DNA and, by using CPCR, & HLndlll siCe Is added t Che 5' end as well 
as an Ncol site at the translation initiator codon (ATG> . Using the 
published sequence oligomers are designed for amplification of the 
5 promoter (oligomer sequences are shown in Chart 3(a)), The sequence 
of the amplified 600 b.p. gene, designated gmf in Chart 3(b), is 
verified by restriction eiazyme mapping and nucleotide sequencing for 
accuracy . 

The fra^ent gmP was subjected to digestion by Ncol and ligated 
10 to Che Ncol digested fragment pvS (described in Example 1) to obtain 
the fragment re f e rr ed to as gmPpvS In CHtart 3(c). The resulting 
fragment , gmPpvS , is dige s ted wl th Hindi Z 1 , 11 gated into Hindlll 
digested pUClS, and the clone pUClBgmPpvS is obtained. (See Chart 
3(d)). This expression cassette Is used to direct seed specific 
15 expression of coding regions which require transcriptional fusions, 
similar to that shown In Example 1« However, when incorporated into 
the Glycine genome the level of expression nay be somewhat higher 
than the Phaseolus expression cassette of Example 1, due to the more 
native environment for the Glycine cassette. 
20 EXAMPLE 4 

CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING THE 
GLYCIHE HAX tt'-SUBUNlT OF ^-CONGLYCINIH PROMOTER (gmP) AND 
SIGNAL PEPTIDE (gmSP) REGIONS AND THE FHASEOLIK TRANSLATION 
TERMINATION (pvTT) SIGNAL AND POLYADENYIATION (pvS) SIGNALS. 
25 In this example ve describe construction of an expression 

cassette and plasrald which is used for the expression of modified 
seed storage protein genes or heterologous genes In the seeds of 
dicotyledonous plants. Oligomers are used to amplify the promoter and 
signal peptide region from Glycine max genomic DNA. For the oligomer 
30 which is read 5' -* 3' the same 5' primer as described in Example 3 is 
used. To ensure complete reading of the signal peptide, the 3' 
oligomer primer is designed to hybridize just beyond che signal 
peptide cleavage point described by Doyle eC al, (1985), which occurs 
between amino acid residues 25 and 26. Therefore, we have selected 
35 to add the Ncol site after amino acid residue 29. The primers used 
for this amplification are shown in Chart 4(a) . 

The result of this amplification, ■ referred to as gmPgmSP in 
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Chart 4<b) , Is a fragment of 690 b.p. Authenticity of this fragment 
is checked by nucleotide sequencing . It is then subjected to 
digestion by Ncol and ligated to the Ncol cut fragment pvTTpvS 
described In Example 2 . The resulting fragment , referred to as 
5 gmPgmSPpvTTpvS in Chart 4(c), was subjected to digestion by Hindlll 
and cloned into Hlndlll digested pUC18. The clone (designated 
pUClSgmPgmSPpvTTpvS in C^art 4(d)) is isolated and its structure 
checked by restriction enzyme mapping and nucleotide sequencing. 
This expression cassette is used for the expression of modified seed 
10 storage protein genes or foreign genes In the seeds of dicotyledonous 
plants . 

EXAMPLE S 

CONSTRUCTION OF AN EXPRESSION CASSETTE CONTAINING 
THE PROMOTER (zmP) AND POLYADENYIATION SIGNAL (zmS) FROM 

15 THE ZEA MAYS 15 kDAL SEED STORAGE PROTEIN GENE, 

The zein proteins constitute about 80% of the total seed protein 
of Zea mays (com); however, there are many zein-related genes 
responsible for the production of these zein proteins. Recently, one 
particular member of the zein gene families was found to be represen- 

20 ted in low copy number , but responsible for the production of about 
10% of the total zein proteins. These genes are referred as ^-zelns 
(or 15 Kd) genes and the sequence of one meinber has been described by 
Federsen et al. (1986). However, the nucleotide sequence of the ^- 
zein gene reported by Pedersen only extends about 233 b.p. 5' of the 

25 translation initiation, thus the complete ^-zein gene promoter may 
not be present. To obtain more of the ^-zeln promoter we isolated a 
second gene, referred to as pZeinl5K, and our nucleotide sequence 
analysis (Chart S) shows that this clone contains an additional 110 
b.p. of 5' flanking DNA, giving a total promoter region of about 350 

30 b.p. to the translation initiation codon (see Chart 5). Analysis of 
many zein type genes has shown that sequences responsible for con- 
trolling the developmental expression of seed specific genes are 
located within these 350 b.p. (see review by Thompson and Larkins, 
BioEssays, 10:109-113 1989). 

35 Clone pZeinlSK (Chart 6Ca) ) provides the starting material for 

the CPCR suDplif ication of its promoter and polyadenylation regions. 
However, prior to amplification an indigenous NcoX site in the 
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promoCer region was removed by flushing with mung bean nucleas 
following the procedure of Maniatis» et al. (1982), which caused a 
loss of four base pairs (CATG) . These four base pairs are located 
between the CCAAAT and TATAA elements and thus should not affect the 
5 function of the promoter. This modified ;9-zeln gene promoter region 
was amplified using CPCR to add a Sail site and a Ncol site to the 5' 
and 3' primers, respectiveiy . The sequences of Che oligomers used to 
accomplish this are shown in Chart 6(b). The amplified product was a 
350 b.p. fragment which is referred to as zmP in Chart 6(c). Addi- 

10 tional CPCR oligomer primers were used to amplify the poly adenylat ion 
region of pZein 15K so as to add a <5') ffcol site and a (3') Sail 
site. Oligomers sequences are shown in Chart: 6(d). The anplified 
product was a fragment of approximately 700 b.p. which is referred to 
as zmS in Chart 6(e). The two amplified fragments, zaP and zmS, are 

IS subjected to digestion with Ncol followed by ligation and isolation 
of the fragment referred to as ziaFzmS in Chart 6 ( f ) . zmFzmS Is 
digested with Sail and cloned into Sail digested pTJClS. Clone 
pUClSzmFzmS (Chart 6(g)) is isolated and its structure verified using 
restriction enzyme mapping and nucleotide sequencing. The expression 

20 cassette pUClBzmPzmS is used for seed specific expression of modified 
seed storage protein genes or other genes which do not require a zeln 
signal peptide and/or a zeln translation terminator. 

EXAMPLE 6 

CONSTRUCTION OF A KOKOCOT SEED SPECIFIC EXPRESSION CASSETTE 
25 COKTAINING THE ^-ZEIH FROKOTER (zmP) , SIGNAL PEPTIDE (znSP) , 

TRANSLATION TERMINATOR (zmTT) AND POLYADENYLATION SIGNAL (zmS) . 
This expression cassette Is similar to that in Example S, with 
the addition of the ^-zein signal peptide and translation terminator 
signals. As in Example 5, CPCR adds a 5* Sail site and a 3' inframe 
30 Ncol site. Since in this case, however, the signal peptide cleavage 
point is located between amino acid residues 20 and 21. we have 
included amino acid sequences up to and including residue 25. Thus, 
the Ncol site is placed 3' of this residue. The oligomer primers 
used to engineer and amplify this zeln promoter and signal peptide 
35 fragment are shown in Chart 7(a). This amplification results In a 
400 b.p. fragment referred to as zmPzmSP in chart 7(b). 

In a similar manner, CPCR amplification of the ^-zeln transla* 
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tlon terminator and polyadenylation signal regions of pZcin 15K is 
done using a linker engineered to contain a 5' Inframe Ncol site and 
a 3 ' Sail site; sequences are shown In Chart 7(c) . Amplification 
results in a 720 b.p. fragment referred to as zmTTzmS in Chart 7(d). 
5 The fragments zmPzmSP and zmTTzmS are subjected to digestion by Hcol 
and ligated together to obtain the fragment referred to 
zmFzmSFzmTTzmS in Chart 7(e). This fragment Is then subj ected to 
digestion by 5aII and cloned Into Sail digested pUC18. The resulting 
clone, pUClSzmPzmSPzmTTzmS (Chart 7(f)), is confirmed by restriction 
10 enzyme site mapping and nucleotide sequencing. This seed specific 
expression cassette pUCzmFzmSpzmTTzmS Is used to regulate the 
expression of modified seed storage protein genes or other genes 
which may require a monocot or ^-zeln signal peptide and Inframe 
translation terminator signal. 
15 EXAMPLE 7 

THASEOWS VULGARIS, GLYCINE MAX, AND ZEA HAYS 
SEED SPECIFIC EXPRESSION CASSETTES FOR THE EXPRESSION OF 
BRAZIL NUT 2S SULFUR-RICH SEED STORAGE PROTEIN GENE. 
This example illustrates the use of the seed specific expression 
20 cassettes designed in Examples 1^ 3, and 5 for the expression of a 
sulfur-rlch (26% methionine + cystine) seed storage protein Isolated 
from Brazil nut. The isolation and nucleotide sequence of this 
Brazil nut gene has been reported by Altenbach et al. , Plant Hoi. 
Biol. 8:239-250 (1987). and the cDNA clone referred to as pHS-3 
25 contains all but 25 b.p. of the 5' -untranslated region, all of the 
coding OHA and the complete 3' -untranslated region, as illustrated in 
Chart 8. Because the gene encodes a protein which Is normally found 
in a plant seed environment it Is believed that its stable expression 
and proper transportation to seed storage protein bodied are charac* 
30 terlstics which are inherent to Its structure. However, an analysis 
of the sequence reveals a second Ncol site located near the transla- 
tion termination codon. Thus for the ease of engineering we elimina- 
ted this Wcol site by using CPGR to replace the native Ncol site with 
an inframe Ncol site , This results In the second G being replaced 
35 with a C nucleo tide and removes the Internal Ncol site. The Ncol 
site added via CPCR is used to clone this region Into an expression 
cassette which has an Inframe translation termination codon. For 
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this conscruction w use the promoter regions described In Examples 
1, 3, and 5 and translation temtination and polyadenylation signals 
described In Examples 2, 4, and 6. 

CPCR amplification of the Brazil nut gene added vaa used to 
5 engineer flanking ^coX sites; the 5' Ncol site is added to take 
advantage of the naturally occurring sites located at the Brazil nut 
translational Initiation cbdon (ATG) , and the second Ncol site is 
removed and replaced by a Ncol site as described above. A further 
advantage of this modification is the addition of the amino acid 
10 tryptophan to the gene. Tryptophane is also deficient in the 
proteins of many plant seeds thus its addition results in a more 
nutritionally complete seed protein. The primers for the amplifica- 
tion of the Brazil nut coding region are sbovn in Chart 9<a>. 
Amplification yields a 420 b.p. fragment which is referred to as BZ 
15 In Chart 9Cb}. This frag^nent is subjected to digestion vlth KcoX and 
inserted into the expression cassettes described earlier. The 
following clones are isolated, pUClBpvPBZpvTTpvS, pUC18gmPB2pvTTpvS , 
and pUClSzmPSZzmTTznS , and their structures are checked using 
restriction enzyme mapping and nucleotide sequencing. The structure 
20 of these clones are shown in Charts 9(c) (1), (11) » and (111). These 
cassettes nay then be removed and placed Into an appropriate vector 
for transfer into plants . These transfer vectors nay be used In 
either the ^robacteriuza systems or in systems which utilize micro* 
projectile boinbardment of plant tissues. 
25 EXAMPLE 8 

USE OF PHASEOWS VULGARIS, GtYClUE MAX, AND Z£A HAYS 
SEED SPECIFIC EXPRESSION CASSETTES FOR THE EXPRESSION OF A 

SYNTHETIC SULFUR-RICH SEED STORAGE PROTEIN GENE. 
Because seeds from plants are deficient in more than one amino 
30 acid and because no single seed storage protein gene has been found 
which satisfies all of these needs , it is most likely that these 
needs will be accomplished via a synthetic coding region. To illu- 
strate this possibility three oligomers are synthesized (shown in 
Chart 10) representing portions of the Brazil nut coding DNA. As 
35 shown in Chart 10(a), the first oligomer encodes the major portion of 
the 9 kD subunic (BZ-1) , which contains 35% sulfur amino acids for 
the 48 amino acid peptide. Chart 10(b) shows a second oligomer con- 
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slsting of the complete 9 kD subunit (BZ-2) which encodes 70 amino 
acids containing 28.5% sulfur amino acids. Th third oligomer, Chart 
10(c), is the complece 12 kD subunit (BZ-3) which encodes 93 amino 
acids containing 28% sulfur amino acids. BZ-1 was synthesized as a 
5 single 144 base oligomer, SZ-2 as a 210 base oligomer and BZ-3 as a 
279 base oligomer. These oligomers were synthesized by utilizing the 
published sequence for tHe BZ gene and were completed on an Applied 
Biosystems, Inc. Model 3S0A DNA Synthesizer. The brazilnut gene 
fragments were complemented, purified, and had Vcol sites added on 
10 both 5' and 3' ends using CPCR amplification. As is true of the BZ 
gene in Example 7, each inframe iVcoI site yields the codon TGG which 
encodes tryptophan, making resulting plant seed a more nutritionally 
complete foodstuff. 

The primers used for amplification of BZ-1 are shown in Chart 
15 11(a), those for BZ-2 in Chart 11(b), and those for BZ-3 In Chart 
11(c) , Because of the addition of the extra nucleotides the frag- 
ments isolated after amplification are about 15 b,p. larger than the 
original synthesized fragments BZ-1, BZ-2, and BZ-3. These fragments 
are subjected to digestion with Ncol and cloned Into the Ncol cut 
20 expression cassettes described In Examples 2» 4 and 6. The following 
clones are isolated, pUC18pvPpvSPBZ-l , -2, and -3pvTTpvS (Charts 
12(a)-(c)). pUC18gmPgmSPBZ-l. -2. and -3pvTTpvS (Charts 13(a)-(c)), 
pUClSzmPzmSPBZ-l, -2, and -3pvTTpvS (Charts lACa)-(c)). The struc- 
ture of these clones is checked by restriction enzyme mapping and 
25 nucleotide sequencing. The expression cassettes contained within 
these plasmids may removed and transferred into vectors used for 
Agrobacteriua systems or may used with microproj ectlle bombardment of 
plant tissues. 

EXAMPLE 9 

30 SEED SPECIFIC EXPRESSION CASSETTES FOR 

THE EXPRESSION OF A FOREIGN GENE. 
In this example we have selected to use the seed expression 
cassettes described above for the engineering of vectors capable of 
expressing the bovine somatotropin gene (Bst) ; this gene is also 
35 referred to as the bovine growth hormone (Bgh) gene. The sequence of 
the Bst gene has been determined by Uoychik et al. , Huclelc Acids 
Research, 10:7197-7210 (1982). however, this sequence is not accep- 
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table for express! n in a plant: system due to the presence of U 
introns in the native gene (Barta et al. 1986) . To avoid this 
problem ve use an intronless (minigene) construction of the Kst gene 
obtained from Professor Fritz Rottnan, Department of Biology and 
5 Microbiology, Case Western Reserve University, Cleveland OH. This 
clone is referred to as pSVGH-1 (delta IVS) and is used to provide 
the Bst coding region for CPCR engineering. As is clear fron^ the 
prior examples, the use of CPCR allows the coding region to be en- 
gineered in such a manner to be accepted by any of the expression 
10 cassettes described in Examples 1 to 6. These arrangements can 
include the use of different gene regulatory regions which can be 
provided by the expression cassette or even from the Bst gene. 

Bst protein is encoded in a gene which contains a 26 amino acid 
signal peptide followed by a coding region containing 191 amino 
IS acids. The clone pSVGK-1 also contains the indigenous Bst gene 
poly adenylat ion signal. For the Bst protein to be active its signal 
peptide must be removed, thus by using CPCR In combination with the 
cassettes described above we have the option of expressing it with or 
without its signal peptide, with or without a seed protein signal 
20 peptide, and with or without its polyadenylation signal. Thus many 
different construction can be derived for the expression of Bst in 
plant seeds. This example illustrates several of these expression 
cassettes coding foe Bst. 

The first series of constructions uses the seed storage protein 
25 gene expression promoters, described in Exaaples 1, 3 and 5, to 
construct transcriptional fusions genes which use the Bst signal 
peptide and polyadenylation signal. Using the clone pSVGH (delta 
IVS) as starting material, CPCR is used to add an ffeol site to its 5' 
end (retaining the translation Initiation signal) and adding a 
30 HindUl site 3' of the Bst polyadenylation signal. The oligomer 
primers of this construction are shown in Chart 15(a) and are used to 
amplify a 840 b.p. fragmenC referred to as bvSPSSTbvTTbvS . This 
fragment is subjected to digestion by ycol and cloned into the seed 
promoter fragments described in Examples 1 and 3, see Charts lS(b) 
35 and 16(a) . These ligated seed promoter-Bst fragments are then 
digested with Kindlll followed by ligation to /Tindlll digested pUC18 
and the following clones isolated^ pUClSpvFbvSPBSTbvTTbvS and 
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pUCl3gmPbvSF£5TbvTTbvS , as sh vn In Charts X5(c) and 16(b), respec- 
tively. The correct sequence is verified using restriction enzyme 
site napping and nucleotide sequencing. The inserts from these 
clones can be removed and transferred into either an Agrobaccecium 
5 vector system or into a vector to be used for micropcojectile 
bombardment of plant tissues. 

To avoid the possibility of having an unstable mRKA, product, the 
fist gene can be cloned directly Into the Ncol site located within the 
seed expression cassettes shown in Examples X and 3 . To accomplish 
10 this only the 3' oligomer primer needs to be changed to Include Ncol. 
This is done with the use of the oligomer primer shown In Chart 17(a) 
and CFCR engineering. The fragment containing the Bst gene obtained 
from this amplification is also about 840 b.p. in length, and is 
referred to as bvSPBSTbvTTbvS in Chart 17 (b) . This fragment is 
15 digested with Hcol and cloned Into the Scol site of the plant 
expression cassette shown in Example 1. From this ligation, a clone 
designated pUClBpvPbvSPBSrbvTTbvSpvS is obtained and is shown in 
Charts 17(c) . In a similar manner, the expression cassette of 
Example 3 is inserted into bvSPBSTbvTTbvS (Chart 18(a) > resulting in 
20 the plasmld pUClSgmPbvSPBSTbvTTbvSpvS of Chart 18(b). 

Additionally, by the use of another 3' oligomer primer a similar 
construction is derived which does not include the Bst polyadenyla- 
tion signal. The 3' primer used to obtain the Bst such a fragment 
without its polyadenylation signal is shown in Chart 19(a). Ampllfi- 
25 cation yields a fragment of about 650 b.p., referred to as 
pvSPBstbvTT In Chart 19(b). This is subjected to digestion by Vcol 
and cloned into the plant expression vector shown in Example 1, 
resulting in clone pUClSpvPbvSPBSTbvTTpvS (Chart 19(c)). In addition, 
this Bst fragment (Chart 20(a)) is ligated into the vector of Example 
30 3 which results in clone pUClSgroPbvSPBSTbvTTpvS , shown in Chart 
20(b). 

EXAMPLE 10 
SEED SPECIFIC EXPRESSION CASSETTES FOR 
THE EXPRESSION OF A FOREIGN GENE. 
35 In 1 ts ac 1 1 ve f o r m , fist do e s no t Inc lude its 19 am Ino ac 1 d 

signal peptide, which is cleaved soon after synthesis (Voychlk et 
al., 1982). However, in a plant system it is doubtful that this 
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signal peptide would b removed, thus its r ooval would have to b 
done as part of a purification procedure. In addition, the above 
constructions do not include the normal signal peptides "'for the plant 
seed storage protein genes to ensure correct targeting of the Bst 
S peptide. The following constructions illustrate a translational 
fusion, made with the Bst protein gene, which does not include the 
Bst signal peptide but dbes contain the slg;nal peptides from the 
plant seed storage protein genes. This is accomplished by using CPCR 
to amplify the part of the Bst gene which does not include the Bst 
10 signal peptide and cloning It into the expression cassettes described 
in Examples 2 and 4. The polypeptide resulting from expression from 
these fused gene is expected to Initially Include the seed storage 
signal peptides , which should be removed as a result of normal 
proteolytic cleavage to yield a Bst polypeptide which would be 
15 Identical to the naturally occurring peptide except for the presence 
of two to four additional amino acids at its N- terminus. These 
additional amino acids are needed to ensure that the signal peptide 
cleavage site is recognized (see Examples 2, 4 and 6). 

The appropriate Bst gene fragment can be obtained using CPCR 
20 amplification using the two primers shown in Chart 21Ca) which 
amplifies a 600 b.p. fragment referred to as BSTbvTT in Charts 21(b) 
and 22(a). This fragment is subjected to digestion by Ncol and then 
llgated into the seed storage protein gene cassettes described in 
Examples 2 and 4 to obtain clones, pUClSpvPpvSPBSTbvTTpvS (Chart 
25 21Cc)) and pUClSgmFgmSFBSrbvTTpvS (Chart 22(b)). The engineered 
expression cassettes contained within these clones can be removed by 
digestion with IfXndlll followed by transfer to vectors designed to be 
used with ^robacteriuis or microproj ectile gene transfer systems. 

EXAMPLE 11 

30 PLANT CONTAINING A CASSETTE FOR EXPRESSION OF BST 

A plant was produced which contained an expression cassette 
coding for the bovine somatotropin gene, BST. We began by removing 
the regulatory region of the phaseolin gene from clone pFhas 8.8 
SslII, J.L.Sllghtom, et al, Proc. Natl. Acad. Sci. U.S.A. 80:1897- 

35 1901 (1983), by restriction endonuclease digestion with figlll and the 
fragment purified following standardized techniques. Next, we used 
the transformed SV40 plasmid, pSVbGH-1, (described earlier) contain- 
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ing a complenjentary DNA (cDNA) of bovine somatatropln (Bst) and a Bst 
polyadenylation region; these are fused downs tr am of the SV40 
promoter. Plasmid pSVbGH-1 contains a single restriction site for 
B am HI which lies between the SV40 promoter and the Bst gene. Finally, 
5 pFhasBst Is made by digestion of pSVbGH-1 with BamHI followed by 
subsequent Incorporation of the purified promoter region at the 
restriction site. 

The region of the pFhasBst plasmid containing the Bst genes and 
the phaseolln regulatory region Is removed by digestion with Kpn l and 

10 inserted at the Kpn l site of A grobacteri^um binary vector pGA482G to 
make plasmid pCA482G/FhasBst. pGA482G/PhasBst is used to transform 
A grobacterium strain C58Z707. A.G. Hepburn, et al, J. Gen. Kicrobiol, 
131:2961-2969 (1985). Finally, the resultant Agrobacterium is used to 
infect cultured tobacco leaf pieces, from Which whole plants were 

15 derived. 

Transgenic tobacco plants containing the FhasBst gene are 
obtained through the Rl generation . Seeds from theses plants , R2 
seeds, are stained with Bst antibody for TEM visualization. Results 
show localization of Bst protein within the protein storage bodies. 
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CHART I 

Construction of pUClSpvFpS expression cassette 

5 

(a) Hindlll 



15 



35 



45 



10 5 ' - CCTAAGAAGCTT GGCACTCTGTGGTCTTTrGGTTC- 3 ' 

5' Oligomer primer 



Hcol 



5' -GTCAAGCCCATGG TTAGTAGAGTAGTATTGAATATG-3 
20 3' Oligomer primer 



(b) Hindlll Ncol 

25 I t 



pvp 
500 b.p. 



30 

(c) Ncol 



5 ' - CAATTGCCATGG TTGTATGAACTAAAATGCATGTAG - 3 ' 
5' Oligomer primer 



40 Hindlll 



5 ' -CAACTGAAGCTT GTTGAAACATCCCTGAAGTGTC- 3 ' 
3' Oligomer primer 



(d) Ncol Hindlll 

J ^ 

pvS 
1100 b.p. 
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CHART 1 fcontlnued") 
<e> Hlndlll Ncol Hlndlll 



5 



10 



20 



pvPpvS 
1600 b.p. 



(f) Hlndlll Ncol Hindlll 

15 * \ I I * 

pvp > pvs > 



pUClSpvPpvS 
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(a) 

10 



15 



20 



25 (b) 



(c) 

35 



40 



50 (d) 



CHART 0 , 

Construction of pUClSpvPpvSPpvTTpvS 



Hindi II 



5'-CCTAAGAAC?CTT GGCACTCTGTGGTCTTTTGGTTC - 3 ' 
5' Oligoiner primer 



Ncol 



5'-AATTAATCCATGG CCGGAGTGAAGTGGCAAATG- 3 ' 
3' Oligomer primer 



NcoI 



pvPpvSP 

30 580 b.p. 



Ncol 



5'-AATCTTCCATGG GCATTTGTGTACTGAATAAGTATG-3' 
5' Oligomer primer 



Hindi II 



45 5' -CAACTGAAGCTT GTTGAAACATCCCTGAAGTGTC 3 ' 

3' Oligomer primer 



Ncol 

I Hindi I I 



pvTTpvS 

55 1200 b.p. 
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5 (e) Hindlll 



10 



15 



20 



(f) Hindi! I 

* t 
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CHART 2 (continued) 



Ncol 



pvPpvS PpvTTpvS 



Ncol 



Hindlll 



Hindlll 



pvp --,.-> pvs > pvTT > pvs 

pUClSpvPpvS PpvTTpvS 
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CHART 3 

Construction of pUClSgmPpvS 

Hlndlll 
U 

5' -TACATGAAGCTT GATCCATGCCCTTCATTTGCCG • 3 ' 
5' Oligiacr primer 



5' -TCAAGCCATGG TTAGTATATCTTAAATTCTTTAATAC-3' 
3' Ollgoaer primer 



Ncol 



Hlndlll 



Ncol 



gap 
600 b.p. 



Hlndlll 



Hcol 



Hlndlll 



gaPpvS 
1700 b.p. 



Hindi II 



Ncol 



Hlndlll 



> pvS 




pUClSgmPpvS 
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CHART 4 



Construction of pUClSgmPgmSPpvTTpvS 



10 



Hindlll 



5 ' - TACATGAAGCTT GATCCATGCCCTTCATTTGCCG - 3 ' 
5'-01igmer 



15 



20 



Ncol 
l_ 

5' -TTTAAATCCATGG TTCCCAATACGCAATGCCAAATGAG - 3 
3' Oligmer 



25 



30 



33 



(b) 



Htndlll 



Ncol 



(c) Hindlll 



gmPgmSP 
690 b.p. 



Ncol 



gmPgmS PpvTTpvS 
1890 b.p. 



Hindlll 



40 <d) Hindlll 
* I 



45 



Ncol 



Hindlll 



gmp > gmSP > pvTX > pvs > 

pUClSgmPgmSPpvTTpvS 
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CHART S 

Nucleotide Sequence of pZelnlSk 

5 



GGATCCAAGGCACCTAACAACTAGCACAACATTACAACAAAATATTGGTGTATATGTGCC 
1 + + + + + + 60 

10 

TACAATAAGGTGATGAGTCATGGTGATGTGTAAAGAGGCATTACAAAGTTAGCTTCACAA 
61 + +- + + + + 120 

15 

GCGTATGAATTCATTGACAACCCTTGACATGTAAAGTTGATTCATATGTATAAGAAAGCT 
121 + + + + + + 180 



20 TAATGATCTATCTGTACATCCAAATCCATGTACTATGTTTCCACgTCATGCAACGCAACA 

181 + + + + + --+ 240 



TTCCAAAACCATGGATCATCTATAAATGGCTAGCTCCCACATATGAACTAGTCTCTATCA 
25 241 + + + + + + 300 



TCATCCAATCCAGATCAGCAAAGCGGCAGTGCGTAGAGAGGATCGTCGAACAGAACAGCA 

301 + + + *,-.-+ + 360 

30 M 

TGAAGATGGTCATCGTTCTCGTCGTGTGCCTGGCTCTGTCAGCTGCCAGCGCCTCTGCAA 

361 + + + + + + 420 

KMVIVLVVCLALSAASASAK 

35 

TGCAGATGCCCTGCCCCTGCGCGGGGCTGCAGGGCTTGTACGGCGCXGGCGCCGGCCTGA 

421 + + + + + + 480 

QHPCPCAGLQGLYGAGAGLT 

40 

CGACGATGATGGGCGGCGGCGGGCTGTACCCCTACGCGGAGTACCTGAGGCAGCCGCAGT 

481 + + + + + + 540 

TMMGAGGLYPYAEYLRQPQC 

45 

GCAGCCCGCTGGCGGCGGCGCCCTACTACGCCGGGTGTGGGCAGACGAGCGCCATGTACC 

541 + + + + + + 600 

S PLAAAPYYAGCGQTSAMYQ 

50 

AGCCGCTCCGGCAACAGTGCTGCCAGCAGCAGATGAGGATGATGGACGTGCAGTCCGTCG 

601 + + + + + + 660 

PLRQQCCQQQMRMHDVQSVA 

55 
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CHART 5 f continued^ 



CGCAGCAGCTGCAGATGATGATGCAGCTTGAGCGTGCCGCTGCCGCCAGCAGCAGCCTGT 

5 661 + 4- + + + + 720 

QQLQMMKQLERAAAAS S SLY. 



ACGAGCCAGCTCTGATGCAGCAGCAGCAGCAGCTGCTGGCACTCCAGGGTCTCAACCCCA 

10 721 + +-- + + + + 780 

EPALMQ QQQQLLALQGLNPM 



TGGCCATGATGATGGCGCAGAACATGCCGGCCATGGGTGGACTCTACCAGTACCAGTACC 

15 781 + + + + + + 840 

AMMHAQNHPAMGGLYQYQYQ 



AGCTGCCCAGCTACCGCACCAACCCCTGTGGCGTCTCCGCTGCCATTCCGCCCTACTACT 

20 841 + + + + + ---+ 900 

LPSYRTNPCGVSAAIPPYY* 



GATTCATGATATTTGGGAAATCTCCTCTATCCATCCCTCTCTATCTATATATGTAATAAT 
25 901 + -f + + + + 960 



GCAGTAAGACGACACACATTATCATGTGTGGTATGACCAATAATATATGCATCATAATAA 
961 + + + + + +1020 

30 

AGTTTTGGTrrTAAAGAATTATCGGACGCTTGATATCTATGATGCTGGATAAATCAAAAC 
1021 + + + + + +1080 

35 

TTCTCATATAAATTGTAAATATTTCAAAATCTCTATTTAGGCTCCAATGGAGAGCATATG 
1081 + + + + + +1140 



40 GGTAGAGTAGTATATATGCTTGAAATACTAACAACTAGCAAAGTGCGGGCACGTTGCTAC 

1141 + + + + + +1200 



ATGCTCATTTATGCTCGAGCATGGAGTATAAAACATAAAGATATATATGTTCTATTGGCT 
45 1201 + + + + + +1260 



TGGTAAACGTGGATATAGGTTTAAAGCCAACAACTATGGTTCGAATCCTCATTTATATAT 
1261 + + + + + +1320 

50 

AATCTATAATTTTAGCGTTTTTACTATTTAAATTTTGGGAGTAAGACGAGAAGGTGCAGG 
1321 + + + + + +1380 

55 
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CHART 5 ^continued) 



5 GACTTACTCrrGACAACAAATCAATTCGGAAGACACAACAGTAAAAGATAAAATrXCCGA 

1381 + + + + + +1440 



CGGCCAAATGCTTATGCTTCGACGACACCTATCCCTCGGCTCGTAATGGGTCGGAAATAA 
10 1441 + +--■ + + + +1500 



GGTTATTTCTGACGGCGGCCGTCGGAAATAGCTTATGTTCCGTACGGCCCGAAAGGGCCA 
1501 + + + + + +1560 



15 



20 



TCGATCATTTACCTAAGATAACGATCGTAACGGCCGATGGTGGGTTCCACAGTTCTTATG 
1561 + + + + + +1620 



TCGACGCACTGCTAACCATCGATTATTCGATTCTATGTCTCCGAGGCTTATAGTATTGCG 
1621 + +--- + + + +1680 

25 GGCAAACTTATTTGGCArrATGAACTTAACGGAACCATTACATATATATACATATTGATT 

1681 + + + + -...+ +1740 



CATATAC 
30 1741 1747 



BNSDOCID: iMO 9113993A1J_> 



wo 91/13993 



PCr/US91/00887 



-31- 



CHART 6 



Construction of pUClSzmPzmS 

5 



(a) * 



p > SP > coding > TT > S > 

10 

pZeln I5k 



15 (b) Sail 

U 

5' -ATAGCTGTGGAC GATCCAAGGCATCTAACAACT-3 ' 
5' oligomer primer 

20 

Ncol 

u 

25 5' -GTCGTACCATGG TGTTCTGTTCGACGATCCTCTCTA-3' 

3' oligomer primer 

30 

(c) Sail Ncol 

J L 

zmP 

35 350 b.p. 



(d) Ncol 

5 ' - CATAGCCATGG TATTTGGGAAATCTCCTTC - 3 ' 
5' oligomer primer 

45 

Sail 



5' - TAGACTGTCGAC AAGAACTGTGGAACCCACCATCG-3' 

50 

3' oligomer primer 



55 
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CHART 6 f contimi^t-j ^.,-^ 



Ncol 



20 

(e) Sail 



•at 



zmS 
700 b.p. 



zmPzmS 
1050 b.p. 



Neol 
i 



> zmS ... 

pUClSzinPEmS 



Sail 



15 Sail 



Sail 

^1 * 
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CHART 7 

Construction of pTJClSzmFzmSFzmTTzmS 



(a) Sail 

X 



10 5' -ATAGCTGTCGAC GATCCAAGGCATCTAACAACT-3' 

5' oligomer primer 



Ncol 



5 ' • AATTACCCATGG GCAGGGCATCTGCATTGCAGAG - 3 ' 
20 3* oligomer primer 

25 (b) Sail Ncol 



zmFzmSP 
400 b.p. 



(c) Ncol 



35 5' -AATCATCCATGG CCCTACTACTGATTCATGATATTTG-3' 

5' oligomer primer 

^0 Sail 



5 ' - TAGACTGTCGAC AAGAACTGTGGAACCCACCATCG - 3 • 
3' oligomer primer 



(d) Ncol Sail 

J L 

zmTTzmS 
720 b.p. 
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CHART 7 ^^-nn tlnu^H^r,^ 

5 (e) Sail 



10 



Bf«DOCID: <WO 9113993A1J_> 



Sail 

—J L 



zmPzmSPzmTTzmS 
1120 b.p. 



Sail 

* I ^ Sail 

^'"I' -"-> zmSP -.-> zLtt zmS ---J ^* 

pUCl SzmPzmSPzmTTzmS 



wo 91/13993 PCr/US91/00887 

-35- 

CHART 8 



Nucleotide Sequence of Brazil Nut 
5 Sulfur-Rich Seed Storage Protein 



TCCACCACTGCTCTGTATCACATATACCAACCCACGCTTATACCCAGAATCACCATGGCG 

10 1 + + + + + 60 

M A 



AAGATTTCAGTTGCGGCAGCAGCGCTCCTTGTCCTCATGGCCCTCGGCCACGCCACCGCC 

15 61 + + + + + + 120 

KIS VAAAALLVLMALGHATA 



TTCCGGGCCACCGTCACCACCACAGTGGTGGAGGAGGAGAACCAGGAGGAGTGTCGCGAG 

20 121 + + + + + + 180 

FRA TVTTTVVEEENQEECRE 

CAGATGCAGAGACAGCAGATGCTCAGCCACTGCCGGATGTACATGAGACAGCAGATGGAG 

25 181 + + + + + + 240 

QMQRQQ MLSHCRMYMRQQME 

GAGAGCCCGTACCAGACCATGCCCAGGCGGGGAATGGAGCCGCACATGAGCGAGTGCTGC 

30 241 + + + + + + 300 

ESP YQTHFRRGHEFHHSECC 

GAGCAGCTGGAGGGGATGGACGAGAGCTGCAGATGCGAAGGCTTAAGGATGATGATGATG 

35 301 + + + + + + 360 

EQL EGMDESCRCEGLRHKHH 

AGGATGCAACAGGAGGAGATGCAACCCCGAGGGGAGCAGATGCGAAGGATGATGAGGCTG 

40 361 + + + + +- + 420 

RHQ QEEHQPRGEQHRRHHRL 

GCCGAGAATATCCCTTCCCGCTGCAACCTCAGTCCCATGAGATGCCCCATGGGTGGCTCC 

45 421 + + + + + + 480 

AE NIPSRCNLSPHRCPHGGS 



ATTGCCGGGTTCTGAATCTGCCACTAGCCAGTGCTGTAAATGTTAATAAGGCTCTCACAA 

50 481 + + + + + + 540 

I A G F * 
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CHART 8 (continuation^ 



5 ACTAGCTGTTrGTTGGGTTrTGGCCGGAGACTAGGGTGTGGGATTAATAATAATAGCACA. 

541 + + + + + + 600 



CTATCGTGTGTTCTCAGCTTC 
10 601 + +i 621 
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CHART 9 



Construction o£ Expression Cassettes Using the Brazil Nut 
Sulfur-Rich Seed Storage Protein Gene (B2) 



(a) Ncol 
10 ' !_ 



X5 



5 ' - AATTCCACCATGG CGAAGATTTCAGTTG - 3 ' 
5* oligomer primer 

Ncol 



5 ' - AATTCAACCATGG CATGGGGGCATCTCATGGGACTG - 3 ' 
20 3' oligomer primer 



(b) Ncol Ncol 
25 1 1 



30 



35 



BZ 
420 b.p. 



(c)(1) Hindlll Ncol Ncol Hindlll 

* \ I I \ * 

pvP > BZ > pvTT > pvS > 

pUClSpvPBZpvTTpvS 



40 (c)(ii) Hindlll Ncol Ncol Hindlll 

* 1 ! I l_* 

gmP > BZ > pvTT > pvS > 

pUCl 8 gmPBZpvTTpvS 

45 

(c)(iii) Sail Ncol ■ Ncol ' Hindlll 

* \ \ I I * 

50 zmP > BZ > zmTT ----> zmS > 

pUClSzmPBZzmXTzmS 
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CHART 10 



(a) Nucleotide Sequence of a Synthetic 9 kD subunlt: of a Brazil 
5 Nut SuLfur-Rlch Seed Storage Protein (BZ-1) 



ATGGAGCCGCACATGAGCGAGTGCTGCGAGCAGCTGGAGGGGATGGACGAGAGCTGCAGA 

10 1 + +--: + + + + 60 

HEP HHSECCEQLEGKDESCR 



15 TGCGAAGGCTTAAGGATGATGATGATGAGGATGCAACAGGAGGAGATGCAACCCCGAGGG 

61 + + + + + + 120 

CEG LRHHMHRMQQEEHQFRG 

20 GAGCAGATGCGAAGGATGATGAGG 

121 + + lUU 

EQKRRHKR 



25 



30 



(b) Hucleotlde Sequence of a Synthetic Brazil Nut Sulfur- 

Rich Seed StorageFroteln Complete 9 kD Gene (BZ-2) 



CCCAGGCGGGGAATGGAGCCGCACATGAGCGAGTGCTGCGAGCAGCTGGAGGGGATGGAC 

1 + + + + + + 60 

FRR GKEFHHSECCEQLEGHD 

35 



GAGAGCTGCAGATGCGAAGGCTTAAGGATGATGATGATGAGGATGCAACAGGAGGAGATG 
61 + + +.........+ + + 120 

40 ESC RCEGLRHMMMRMQQEEM 

CAACCCCGAGGGGAGCAGATGCGAAGGATGATGAGGCTGGCCGAGAATATCCCTTCCCGC 

121 + + + + + + 180 

45 QPR GEQMRRKMRLAENIPSR 

TGCAACCTCAGTCCCATGAGATGCCCCATG 

181 + + + 210 

50 CNLSPMRCPM 
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CHART 10 (contlnued> 



(c) Nucleotide Sequence of a Synthetic Brazil Nut 12 kD Subunit 
5 of the Sulfur-Rich Seed Storage Protein Gene (BZ-3> 



10 CAGCAGATGCTCAGCCACTGCCGGATGTACATGAGACAGCAGATGGAGGAGAGCCCGTAC 

1 + + -.-+ + + + 60 

QQH LSHCRMYMRQQMEESPY 

15 

CAGACCATGCCCAGGCCGGGAATGGAGCCGCACATGAGCGAGTGCTGCGACCAGCTGGAG 

61 + + + + + + 120 

QTM PRRGMEPHMSECCEQLE 

20 

GGGATGGACGAGAGCTGCAGATGCGAAGGCTTAAGGATGATGATGATGAGGATGCAACAG 

121 + +-- + + + 180 

GHDESC RCEGLRHHHHRHQQ 

25 

GAGGAGATGCAACCCCGAGGGGAGCAGATGCGAAGGATGATGAGGCTGGCCGAGAATATC 

181 + + + + + + 240 

EEH QFRGEQHRRHHRLAENI 

30 

CCTTCCCGCTGCAACCTCAGTCCCATGAGATGCCCCATG 

241 + + + 279 

PSRCNLSPHRCPH 



35 
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CHART 11 



(a) Ncol 

5 i_ 



5' -AATTATCCATGG ATGGAGCCGCACATGAGCGAGTG-3' 
5' oligomer primer 

Ncol 



5' -AATTCTCCATGG CATCATCCTTCGCATCTGCTC-3 ' 
15 3' oligomer primer 



Cb) Ncol 
20 I 



5 ' - AATTCACCATGG CCAGGCGGGGAATGGAGCCGG- 3 ' 
5' oligomer primer 

Ncol 



5' -AATTCACCATGG CATGGGGCATCTCATGGGACTG - 3 ' 
30 3* oligomer primer 

(c) Ncol 

5 ' -ATTCACCATGG CAGCAGATGCTCAGCCACTGCC-3 ' 
5' oligomer primer 



Ncol 

u 

5 ' - AATTCACCATGG CATGGGGCATCTCATGGGACTG - 3 ' 
3' oligomer primer 
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CHART 17 



Phaseolin Expression Cassettes Using 
Brazil Nut Gene Fragments 



in Hindlll Ncol Ncol Hindlll 

* 1 J -J . I* 

pvP > pvSP > BZl > pvTT > pvS > 

pUC18 pvPpvS PBZ - IpvTTpvS 

15 

(b) Hindlll Kcol Ncol Hindlll 

* — L ! 1 . I 

----> PvSP > BZ2 > pvTT > pvS > 



pUClflpvPpvSPBZ- 2pvTTpvS 



25 (c) ^Hindlll Ncol Ncol Hindlll 



pvP > pvSP > BZ3 > pvTT > pvS > 

pUCl8pvPpvSPBZ-3pvTTpvS 



I * 
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CHART 13 



Glycine Expression Cassettes Using 
Brazil Nut Gene Fragments 



Ca) Hindlll Ncol Ncol Hindlll 

— J-r U \ L_* 

gmP > gmSP > BZI > pvTT > pvS > 

pUClSgmPgniSPBZ-lpvTTpvS 



15 



20 



(b) Hindlll Ncol Ncol Hindlll 



gmP > gmSP > BZ2 > pvTT > pvS > 

pUClSgmPgjiiSPBZ- 2pvTTpvS 



25 (c) ^Hlndlll Ncol Ncol Hindlll 

* 



gmP .---> gmSP ----> BZ-3 ---> pvTT -■-> pvS ----> 
pUClSgmPgmSPBZ- 3pvTTpvS 
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CHART 14 



Zein Expression Cassectes Using 
Brazil Nuc Gene Fragments 



(a) Sail Ncol Ncol Sail 

10 *_J _1 I [_* 

zmp > znsp > B2-1 > zmTT > zmS > 



15 



20 



pUClSzmPzmSPBZ-lzmTTzmS 



(b) Sail Ncol Ncol Sail 

* \ 1 \ \ * 

zmP > zmSP > BZ-2 > zmTT > zmS > 



pUC 1 8 zmPzmS F B Z - 2 zmTT zmS 



25 (c) Sail Ncol Ncol Sail 

*-l 1 t U-* 

zmP > ebbSP > BZ-3 > zmTT > zmS > 



30 



pUClBzmFzmSPBz - 32mTTzmS 
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CHART 15 

Fhaseolln Expression CasseCCes Using B5T Gene 
Ncol 



5' -AATTCAACCATGG GCTGCAGGCCCCCGGACCTCCC-3' 
5' oligomer primer 
Ncol 



5' -AAATTCAAAGCTT GCATCCCCAGCATGCCTGCTATTG-3' 
3' oligomer primer 

Hindlll Ncol Hindlll 

J I L 

pvPbvSPBSTbvTTbvS 

Hindlll Ncol Hindlll 

*_l I U-* 

pvP ---> bvSP ---> BST ---> bvTT > bvS ---> 

pUGl&pvPbvSPBSTbvTTbvS 
CHART 16 

Glycine Expression Cassettes Using BST Gene 
Hindlll Ncol Hlndlll 

J I L 

gmPbvS PBSTbvTTbvS 

Hindlll Ncol Hindlll 

* I I \ * 

gmP ---> bvSP ---> BST ---> bvTT ---> bvS ---> 

pUC 18 gmPbvS PBSTbvTTbvS 
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CHART 17 

Fhaseolln Expression Cass&ttes Using BST Gene 
Ncol 



5' -AAATTACCATGG GCATCCCCAGCATGCCTGCTATTG-3' 
3' oligomer primer 

Ncol Ncol 
J L 

bvSPBSTbvTTbvS 

Hindlll Ncol Ncol Hindlll 

*-J \ \ U* 

pvp > bvSP — > BST > bvTT — > bvS — > pvS > 

pUClSpvPbvSPBSTbvTTbvSpvS 
CHART 

Glycine Expression Cassettes Using BST Gene 
Ncol Ncol 



bvS PBsTbvTTb vS 



Hindi II Kcol Ncol Hindi II 

* I I \ \ 

gmP bvSP BST --> bvTT --> bvs --> pvS - -> 

pUClSgmPbvSPBSTbvTTbvSpvS 
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CHART 19 

Fhaseolln Expression Cassetces Using BST Gene 
Ncol 



5* - AATTCACCATGG GGGGGAGGGGCAAACAACAGATGG-3 
3' oligomer primer 

Ncol Ncol 



pvSPBSTbvTT 

Hindlll Ncol Ncol Hindlll 

* I I I \ * 

pvP > bvSP > BST > bvTT > pvS > 

pUClSpvPbvSPBSTbvTTpvS 

Glycine Expression Cassettes Using BSC Gene 
Ncol Ncol 



pvSPBSTbvTT 

Hindlll Ncol Ncol Hindlll 

* 1 1 \ \ * 

gmP ---> bvSP---> BST ---> bvTT ---> pvS ---> 

pUC 1 SgmPbvS PBSTbvTTpvS 
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10 



15 



CHART 21 

Phaseolin Expression Cassettes Using BST Gene 



5 

(a) Ncol 



5 ' - MTCATCCATGG TTCCCAGCCATGTCCTTGTCCGGCC - 3 ' 
5' oligomer primer 

Ncol 



5' -AATCACCATGG GGGGGAGGGGCAAACAACAGATGG - 3 ' 
3' oligoiner primer 

20 

Cb) Ncol Ncol 



BSTbvTT 

25 

(c) Hlndlll Ncol Ncol Htndlll 

*— J \ I i * 

30 pvP ---> pvSP ---> BST ---> bvTT ---> pvS ---> 

pUC 1 8 p vPp vS PBS TbvTTp vS 
35 CHART 22 

Glycine Expression Cassettes Using BST Gene 

40 

(a) Ncol Ncol 



BSTbvTT 

45 

(b) Hindlll Ncol Ncol Hlndlll 

*-J I \ I * 

50 gmP ---> gmSP ---> BST > bvTT ---> pvS — -> 

pUCl 8 gmPgrnS PBSTbvTTpvS 
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ClAIMS 

We claim: 

1. A seed specific expression cassette having gene regulatory 
5 elements comprising: 

a) a promoter selected from the group consisting of phaseolin 
promoter, a' -subunlt of ^-conglyclnln promoter, and ^-zeln promoter; 

b) a translation initiation signal selected from the group 
consisting of phaseolin translation initiation signal, o'-subunit of 

10 ^-conglyctntn translation initiation signal, ^-zein translation 

initiation signal, and animal gene translation initiation signal; 

c) a gene selected from the group consisting of an animal gene 
and a gene from brazil nut 2S storage protein; 

d) a poly-A signal selected from the group consisting of a 
15 phaseolin poly-A signal and an animal gene poly-A signal; 

wherein the promoter is upstream and operably linked to the Initia- 
tion signal, the initiation signal is upstream and operably lixiked to 
the gene, and the gene is upstream and operably linked to the polyCA) 
signal; and provided that when the gene regulatory element is ampli- 

20 fled from genomic or synthetic DNA the regulatory region is flanked 
by a restriction enzyme recognition site selected from the group 
consisting of Ncol, Hlndlll, and Sail; and further provided when the 
translation initiation signal is from phaseolin the promoter is from 
phaseolin, and when the translation initiation signal is from the o* - 

25 subunit of ^-conglycinin the promoter is from the a'-subunit of ^- 
conglyclnin, and when the gene is from brazil nut the promoter and 
translation initiation signal are derived from the same genome. 

2. A seed specific expression cassette according to Claim 1 wherein 
30 the promoter is from phaseolin, the remaining regions are from bovine 

somatotropin, and further comprising a SV40 promoter region upstream 
and operably linked to the phaseolin region. 

3. A seed specific expression cassette according to Claim 1 wherein 
35 the gene Is an animal gene from bovine somatotropin and includes the 

translation initiation signal. 

4. A seed specific expression cassette according to Claim 3 wherein 
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the promoter Is from phaseoXint the remaining elements are froa 
bovine somatotropin, the regions linking both the promoter and the 
poly-A elements to the plasnld each code for Hindlll, and the regions 
linking the promoter to the translation Initiation signal codes for 
5 Ncol . 



5. A seed specific expression cassette according to Claim 3 wherein 
the promoter is from the a'-subunit of ^-conglycinin, the remaining 
elements are from bovine somatotropin, the regions linking both the 

10 promoter and the poly-A elements to the plasmid each code for 

Hindlll, and the region linking the promoter to the translation 
initiation signal codes for Ncol. 

6. A seed specific expression cassette according to Claim 3 wherein 
15 the promoter and the poly-A signal are from phaseoltn, the remaining 

elements are from bovine somatotropin, the region linking both the 
promoter and the poly-A elements to the plasmid each code for 
Hindlll, and the region linking the promoter to the translation 
initiation signal and the gene to the poly-A signal code for Hcol. 

20 

7. A seed specific expression cassette according to Claim 3 wherein 
the promoter and the poly-A signal are from the a'-siibunlt of ^-con- 
glycinin, the remaining elements are from bovine somatotropin, the 
region linking both the promoter and the poly-A elements to the 

25 plasmid each code for Hindlll, and the region linking the promoter to 
the translation initiation signal and the gene to the poly-A signal 
code for Ncol. 



8. A seed specific expression cassette according to Claim 3 wherein 
30 the promoter, translation initiation signal, and poly-A signal are 

from phaseolln, the region linking both the promoter and the poly-A 

• elements to the plasmid each code for Hindlll, and the region linking 
the translation initiation signal to the gene and the gene to the 

* poly-A signal code for Ncol. 
33 

9. A seed specific expression cassette according to Claim 3 wherein 
the promoter is from phaseolln, the translation initiation signal and 
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gene are from bovine somatotropin, and having a flrst^ poly-A. signal 
from bovine sotnacotropln upstream and operably linked to a second 
poly-A signal from phaseolln, the region linking both the promoter 
and the second poly-A elements to Che plasmld each coding for 
5 Hlndlll , and the region linking the promoter to the translation 

initiation signal and the first poly-A signal Co the second poly-A 
signal coding for Ncol. 

10. A seed specific expression cassette according to Claim 3 wherein 
10 the promoter Is from the a' -subunlc of ^•conglycinin, the translation 

initiation signal and gene are from bovine somatrtropin, and having a 
first poly-A signal from bovine somatotropin upstream and operably 
linked to a second poly-A signal from phaseolln, the region linking 
both the promoter and the second poly-A element to the plasmld each 
15 coding for Hindlll, and the region linking the promoter to the 

translation initiation signal and the first poly-A signal to the 
second poly-A signal coding for ticoX. 

11. A seed specific expression cassette according to Claim 1 com- 
20 prising: 

a) a promoter selected from the group consisting of phaseolln 
promoter, a'-subunlt of ^-conglycinln promoter, and ^-zeln 15K 
promoter. 

b) a translation initiation signal selected from the group 

25 consisting of phaseolln translation initiation signal, a'-subunit of 
^-conglycinin translation initiation signal, and ^-zein 15K transla* 
tion initiation signal; 

c) a gene selected from the group consisting of brazil nut gene 
and brazil nut gene subunit; 

30 d) a translation termination signal selected from the group 

consisting of phaseolln translation termination signal and ^-zeln 15K 
translation termination signal; 

e) a poly-A signal selected from the group consisting of phaseo- 
lln poly-A signal and ^-zein 15K poly-A signal; 

35 wherein the promoter is upstream and operably linked to the initia- 
tion signal, the initiation signal is upstream and operably linked to 
the gene, the gene is upstream and operably linked to the translation 
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termination signal, and the translation termination signal Is 
upstream and operably linked to the poly-A signal, and further 
provided that the gene regulatory element is flanked by a restriction 
enzyme recognition site selected from the gr up consisting of Ncol, 
5 Hindlll, and Sail; and further provided that the promoter and 
translation initiation signals are derived from the same genome, 

12. A seed specific expression cassette according to Claim 11 
wherein the gene is BZ. 

10 

13. A. seed specific expression cassette according to Claim 12 
wherein the promoter, translation termination signal, and poly-A 
signal are from phaseolin, the region 15 ;':^ing both the promoter and 
poly-A elements to the plasmid each code for Hindlll, and the region 

15 linking the promoter to the gene and the gene to the termination 
signal code for Ncol. 

14. A seed specific expression cassette according to Claim 12 
wherein the promoter is from the a'-subunit of ^ - conglyc inin , the 

20 translation termination signal and poly-A signal are from phaseolin, 
the region linking both the promoter and poly-A elements to the 
plasmid each code for Hindlll, and the region linking the promoter to 
the gene and the gene to the termination signal code for Ncol. 

25 IS. A seed specific expression cassette according to Claim 12 

wherein the promoter, translation termination signal, and poly-A 
signal are from ^-zeln 15K, the region linking both the promoter and 
poly-A elements to the plasmid each code for Sail, and the regions 
linking the promoter to the gene and the gene to the termination 

30 signal code for Ncol. 

16. A seed specific expression cassette according to Claim 11 
wherein the gene is selected from the group consisting of BZ-1, BZ-2, 
and BZ-3. 

35 

17. A seed specific expression cassette according to Claim 16 
wherein the remaining elements are from phaseolin, the region linking 
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b Ch the pronoter and poly-A el ments to Che plasntld ach code for 
Htndlll, and the region linking the translati n initiation signal to 
the gene and the gene to the t rmination signal code for Ncol. 

5 18. A seed specific expression cassecce according to Claim 16 
wherein the remaining elements are from ^-zein 15K, the region 
linking both the promoter and poly-A elements to the plasmid each 
code for Sail, and the region linking the translation initiation 
signal to the gene and the gene to the termination signal code for 
10 Ncol . 

19. A cell comprising a seed specific expression cassette as claimed 
in Claim 1. 

15 20. A seed or plant comprising cells as claimed in Claim 19. 
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